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I. Introduction 
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Transmitter channel state information (CSIT) is crucial for the multiplexing gains offered by advanced interference management 
techniques such as multiuser MIMO and interference alignment. Such CSIT is usually obtained by feedback from the receivers, 
but the feedback is subject to delays. The usual approach is to use the fed back information to predict the current channel state and 
then apply a scheme designed assuming perfect CSIT. When the feedback delay is large compared to the channel coherence time, 
such a prediction approach completely fails to achieve any multiplexing gain. In this paper, we show that even in this case, the 
(N. completely stale CSI is still very useful. More concretely, we show that in a MIMO broadcast channel with K transmit antennas 

and K receivers each with 1 receive antenna, rrT~r TTt (> 1) degrees of freedom is achievable even when the fed back channel 
state is completely independent of the current channel state. Moreover, we establish that if all receivers have independent and 
identically distributed channels, then this is the optimal number of degrees of freedom achievable. In the optimal scheme, the 
transmitter uses the fed back CSI to learn the side information that the receivers receive from previous transmissions rather than 
to predict the current channel state. Our result can be viewed as the first example of feedback providing a degree-of-freedom gain 
in memoryless channels. 
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In wireless communication, transmitter knowledge of the channel state information (CSIT) can be very important. While in 
point-to-point channels CSIT only provides power gains via waterfilling, in multiuser channels it can also provide multiplexing 
gains. For example, in a MIMO broadcast channel, CSIT can be used to send information along multiple beams to different 
. receivers simultaneously. In interference channels, CSIT can be used to align the interference from multiple receivers to reduce 
I \ the aggregate interference footprint JT], 0. 

In practice, it is not easy to achieve the theoretical gains of these techniques. In the high SNR regime, where the multiplexing 
■ gain offered by these techniques is particularly significant, the performance of these techniques is very sensitive to inaccuracies 
^ ■ of the CSIT. However, it is hard to obtain accurate CSIT. This is particularly so in FDD (frequency-division duplex) systems, 
where the channel state has to be measured at the receiver and fed back to the transmitter. This feedback process leads to two 
sources of inaccuracies: 

> Quantization Error: The limited rate of the feedback channel restricts the accuracy of the CSI at the transmitter. 

• Delay: There is a delay between the time the channel state is measured at the receiver and the time when the information 
is used at the transmitter. The delay comes from the fact that the receivers need some time to receive pilots, estimate 
CSI, and then feed it back to the transmitter in a relatively long coding block. In time-varying wireless channels, when 
the channel information arrives at the transmitter, the channel state has already changed. 
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Much work in the literature has focused on the first issue. The general conclusion is that the rate of the feedback channel 
needed to achieve the perfect CSIT multiplexing gain scales well with the SNR. For example, for the MIMO broadcast channel, 
it was shown in [3| that the rate of feedback should scale linearly with log 2 SNR. Since the capacity of the MIMO broadcast 
channel also scales linearly with log 2 SNR, this result says that the overhead from feedback will not overwhelm the capacity 
gains. 

We now focus on the second issue, the issue of feedback delay. The standard approach of dealing with feedback delay is 
to exploit the time correlation of the channel to predict the current channel state from the delayed measurements j4j. The 
predicted channel state is then used in place of the true channel state in a scheme designed assuming perfect CSIT is available. 
However, as the coherence time of the channel becomes shorter compared to the feedback delay, due to higher mobility for 
example, the delayed feedback information reveals no information about the current state, and a prediction-based scheme can 
offer no multiplexing gain. 

In this paper, we raise the question: is this a fundamental limitation imposed by feedback delay, or is this just a limitation 
of the prediction-based approach? In other words, is there another way to use the delayed feedback information to achieve 
non-trivial multiplexing gains? We answer the question in the affirmative. 

For concreteness, we focus on a channel which has received significant attention in recent years: the MIMO broadcast 
channel. In particular, we focus on a system where the transmitter has M antennas and there are K receivers each with 
a single receive antenna. The transmitter wants to send an independent data stream to each receiver. To model completely 
outdated CSI, we allow the channel state to be independent from one symbol time to the next, and the channel state information 
is available to both the transmitter and the receivers one symbol time later. This means that by the time the feedback reaches 
the transmitter, the current channel is already completely different. We also assume that the overall M-hy-K channel matrix 
is full rank at each time. 

Our main result is that, for M > K, one can achieve a total of 

K 

1+1 + .. • + £ 

degrees of freedom per second per Hz in this channel. In other words, we can achieve a sum rate that scales like: 

K 

j -log 2 SNR + o(log 2 SNR) bits/s/Hz 

1 + 2 + • • ' + K 

as the SNR grows. Moreover, we show that under the further assumption that all receivers have independent and identically 

distributed channels, this is the optimal number of degrees of freedom achievable. 

It is instructive to compare this result with the case when there is no CSIT and the case when there is perfect CSIT. While 

the capacity or even the number of degrees of freedom is unknown for general channel statistics when there is no CSIT, in 

the case when all receivers have identically distributed channels, it is easy to see that the total number of degrees of freedom 

is only 1, Since rrxjr XX > 1 f° r an y K — 2> we see that, at least in that case, there is a multiplexing gain achieved by 
i+ 2 +...+ K 

exploiting completely outdated CSI. However, the multiplexing gain is not as good as K, the number of degrees of freedom 
achieved in the perfect CSIT case. On the other hand, when K is large, 

K K 
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almost linear in K. 

Why is outdated CSIT useful? When there is perfect CSIT, information intended for a receiver can be transmitted to that 
receiver without other receivers overhearing it (say by using a zero-forcing precoder), so that there is no cross-interference. 
When the transmitter does not know the current channel state, this cannot be done and information intended for a receiver 
will be overheard by other receivers. This overheard side information is in the form of a linear combination of data symbols, 
the coefficients of which are the channel gains at the time of the transmission. Without CSIT at all, this side information 
will be wasted since the transmitter does not know what the coefficients are and hence does not know what side information 
was received in previous transmissions. With outdated CSIT, however, the transmitter can exploit the side information already 
received at the various receivers to create future transmissions which are simultaneously useful for more than one receiver 
and can therefore be efficiently transmitted. Note that there is no such overheard side-information in simpler scenarios such 
as point-to-point and multiple access channels, where there is only a single receiver. Indeed, it is shown in Q, [6] that for 



such channels, the only role of delayed CSIT is to predict the current state, and when the delayed CSIT is independent of the 
current state, the delayed CSIT provides no capacity gains. 

The rest of the paper is structured as follows. In Section [II] the problem is formulated and the main results are stated 
precisely. Sections Hill IVII and IVHI describe the proposed schemes, and Section HVl describes the converse. In Section [V] the 
DoF region for the case of M = K is characterized. The connection between our results and those for the packet erasure 
broadcast channel is explained in Section IVIIII Some follow-up results to the conference version of this paper are discussed 
in |IX] We conclude with a discussion of our result in the broader context of the role of feedback in communication in Section 

m 

II. Problem Formulation and Main Results 

We consider a complex baseband broadcast channel with M transmit antennas and K receivers, each equipped with a single 
antenna. In a flat fading environment, this channel can be modeled as, 

y r [n] = h£[n]x[n] + z r [n], r = 1,...,K, (1) 

where f denotes transpose-conjugate operation, x[n] e C Mxl , E[x^[n]x[rt]] < SNR, z r [n] ~ CA/"(0, 1) and the sequences 
z r [n]'s are i.i.d. and mutually independent. In addition, h£[n] = [hl^n], . . . , hl M [n)} 6 C lxM . We define H[n] as H[n] = 
[h.i[n], . . . ,h K [n}]. 

We assume that H[n] is available at the transmitter and all receivers with one unit delay 

Let us define £ as £ = {1, 2, . . . , K}. We assume that for any subset S of the receivers, S C £, the transmitter has a 
message Ws with rate R$ bits/s/Hz. For example, message ^{1.2} is a common message for receivers one and two. Similarly, 
W{!}, or simply W\, is a message for receiver one. We define ds, as 

ds = lim f - — . (2) 

SNR^oo log 2 SNR 

If |>S| = j, then we call Ws an order- j message or a message of order j. We define degrees of freedom order j, DoF* (M, K), 

as 

DoF*(M,iT) = lim max V — ^— . (3) 
3K ' snr-kx) Rec ^ log, SNR 

s,\s\=j 62 

where C denotes the capacity region of the channel, and R g R' 2 _1 ) x1 denotes the vector of the message rates for each 
subset of receivers. We note that DoF* (M, K) is the well-known notion of the degrees of freedom of the channel. 
In this paper, we establish the following results. 

Theorem 1 As long as H[n] is full rank almost surely for each n, and {H[n]} is stationary and ergodic, then for M > K, 

DoFt(M,K)> * (4) 

' 2 K 

More generally, as long as M > K — j + 1, then 

K - 7 + 1 1 

DoF* M, K) > J+ (5) 

For example, DoFj(2, 2) > | and DoF^(3, 3) > j|, which are greater than one. Note that this achievability result holds under 
very weak assumptions about the channel statistics. Hence, even when {H[?i]} is an i.i.d. process over time, delayed CSIT is 
still useful in achieving a degree-of-freedom gain. 

The following theorem gives a tight converse under specific assumptions on the channel process. 



1 All our achievable results hold regardless of what the delay is, since they do not depend on the temporal statistics of the channel. Hence, for convenience, 
we will just normalize the delay to be 1 symbol time. 



Theorem 2 If the channel matrices {H[n]} is an i.i.d. process over time and the channels are also independent and identically 
distributed across the receivers, then 

DoF* (M,K)< y ' J ' j-j- (6) 

min{l,M} min{2,M} mm{A'-j + l,A/} 

The equality between the expressions in © and © in the case of M > K — j + 1 can be verified using the identity ( l47l ). 
proved in Appendix [A] thus yielding the following corollary. 

Corollary 1 If the channel matrices {H[n]} is an i.i.d. process over time and is also independent and identically distributed 
across the receivers, then the lower bounds in Theorem [7J are tight. 

In addition, the region of order-one DoF for the case M = K is characterized as follows: 

Theorem 3 If the channel matrices {H[n]} is an i.i.d. process over time and is also independent and identically distributed 
across the receivers, then the DoF region for the case M = K is characterized as all positive K-tuples (d%, . . . , dx) 
satisfying: 

K 



< 1 (7) 



for all permutations 7r of the set {!,..., K}. 



The achievability result for Theorem Q] holds for M > K — j + 1. We have the following achievability result for general 
M,K and j. 

Theorem 4 Assume that H[n] is full rank almost surely for each n, and {H[n]} is stationary and ergodic. If DoFj + i(Af, K) 
is achievable for order-(j + 1) symbols, then DoF j(M, K) is achievable for order-j symbols, where 

DoF 3 (M,K)= 1 q . J 1 - , (8) 

j + j+l DoF j + 1 (M,K) 

and Qj — min{Af — 1, K — j}. 

Starting from DoF* K (M, K) = 1, which is simply achievable, one can use iterative equation ([H) to derive an achievable 
DoF j(M, K) with the following closed form 

DoF j(M,K)= (9) 

M 
j 



s-^K-M 1 ( M-i y-3 , ( M-\\K-M-j+\ n ' 

2si=j i \ M J ~t~ V M I K2si=K-M+l i I 

for the case of M < K — j + 1. Unlike the case of M > K — j + 1, however, the expression in (0 does not match the 
upper bound in Theorem [2] In particular, this means that Theorem [4] does not allow us to characterize the degrees of freedom 
DoF^(M, K) when the number of users K is greater than the number of transmit antennas M. On the other hand, it is easy to 
verify that the achievable DoFi(A/, K) in Theorem|4]is increasing with K, even when K > M. Therefore, unlike the situation 
with full CSIT, the degrees of freedom under delayed CSIT is not determined by the minimum of the number of transmit 
antennas and the number of receivers. 

For the special case of M = 2 and K = 3, we obtain an exact characterization of the degrees of freedom. 

Theorem 5 Assume that H[n] is full rank almost surely for each n, and {H[n]} is stationary and ergodic, then DoF*(2,3) = I. 



III. Achievable Scheme for TheoremQ] 

In this section, we explain the achievable scheme for Theorem Q] The key is to understand the square case when M 
For simplicity, we start with the cases M = K = 2 and M = K = 3. 



= K. 



A. Achievable Scheme for M = K = 2 

In this subsection, we show that for the case of M = K = 2, the DoF of | is achievable. We explain the achievable scheme 
from three different perspectives: 

1) Exploiting Side-Information 

2) Generating Higher-Order Messages 

3) Interference Alignment using Outdated CSIT 

For notational clarity, in this subsection we will use A and B to denote the two receivers instead of 1 and 2. 
1 ) Exploiting Side-Information: Let u r and v r be symbols from two independently encoded Gaussian codewords intended 
for receiver r. The proposed communication scheme is performed in two phases, which take three time-slots in total: 
Phase One - Feeding the Receivers: This phase has two time-slots. 

The first time slot is dedicated to receiver A. The transmitter sends the two symbols, ua and va, intended for receiver A, 
i.e. 



x[l] = 



At the receivers, we have: 



= h A1 [l]u A + h' A2 [l]vA + za[1], 
y B [l] = h B1 [l]uA + h B2 [l]vA + 2s [1]. 



(10) 



(ID 
(12) 



Both receivers A and B receive noisy versions of linear combinations of ua and ub- Receiver B saves the overheard equation 
for later usage, although it only carries information intended for receiver A. 

The second time-slot of phase one is dedicated to the second receiver. In this time-slot, the transmitter sends symbols 
intended for receiver B, i.e. 



c[2] = 



ub 
vb 



At receivers, we have: 



y A [2] = h A1 [2]u B + h A2 [2]v B + z A [2], 
y B [2] = h Bl [2]u B + h B2 [2]v B + z B [2]. 



(13) 



(14) 
(15) 



Receiver A saves the overheard equation for future usage, although it only carries information intended for receiver B. 
Let us define short hand notations 



L\{ua,va) 
L 2 (ua,va) 
L 3 (u B ,v B ) 
L 4 (ub,vb) 



= tf A1 [i\v,A- 

= h Bll l \ u A 

= h A1 [2}u B - 

= h B1 [2]u B 



h A2 [l]v A , 
h B2 [l]v A , 
h A2 [2]v B , 



^B2 



[2]v B - 



The transmission scheme is summarized in Fig. Q] In this figure, for simplicity, we drop the thermal noise from the received 
signals. We note that, assuming H[l] is full rank, there is a one-to-one map between (ua, va) and (Li(ua, va), L 2 (ua,va))- If 
receiver A has the equation overheard by receiver B, i.e. L 2 (ua, va), then it has enough equations to solve for its own symbols 
ua, and va- Similarly, assuming H[2] is full rank, there is a one-to-one map between (ub,vb) and (L^(ub, vb), L^ub, vb))- 
If receiver B has the equation overheard by receiver A, i.e. L^(ub,vb), then it has enough equations to solve for its own 
symbols ub, and vb- 

Therefore, the main mission of the second phase is to swap these two overheard equations through the transmitter. 

Phase Two - Swapping Overheard Equations: This phase takes only one time-slot at n — 3. At this time, the transmitter 
sends a linear combination of the overheard equations, i.e. L 2 (ua, va) and L^(ub,vb)- We note that at this time the transmitter 
is aware of the CSI at n = 1 and n = 2; therefore it can form the overheard equations L 2 (ua, va) and L^(ub,vb)- 



m = 1 m = 2 



L 2 (ua, V A ) + / 

^2 






m = 1 m = 2 



Y L 2 (u A ,v A ) 



h A1 [3] (L 2 (u A ,v A ) + L 3 (u B ,v B )) 



) + L 



Fig. 1. Achievable Scheme for M = K = 2 



For example, x[3] can be formed as, 



x[3] 



L 2 (ua, v A ) + L 3 (u B ,v B ) 




At receivers, we have, 



Va[3] = h A1 [3) (L 2 (ua,va) + L 3 (ub,vb)) + za[3], 
y B [3] = h' m [3] (L 2 (ua,v a ) + L 3 (u B ,v B )) + z B [3]. 



(16) 



(17) 
(18) 



Remember that receiver A already has (a noisy version of) L 3 (u B , v B ). Thus, together with 2m[3], it can solve for its two 
symbols ua,va- We have a similar situation for receiver B. 

Remark: In this scheme, we assume that in the first time-slot, transmit antenna one sends ua and transmit antenna two 
sends va- However, antenna one and two can send any random linear combination of ua and va- Therefore, for example, we 
can have 



x[l] = A[l] 



UA 
VA 



(19) 



where A[l] € C 2x2 is a randomly selected matrix. Similar statement is true for the second time-slot. At time-slot n = 3, we 
send L 3 (u B ,v B ) + L 2 (ua,va)- However, we can send any combination of L 3 (u B ,v B ) and L 2 (ua,va)- In other words, 



x[3] = A[3] 



(20) 



L 3 (u B ,v B ) 
L 2 (u Al v A ) 

where A [3] G C 2x2 is a randomly selected matrix. However, we can limit the choice of A [3] to rank one matrices. 

Remark: We note that only the number of independent noisy equations that each receiver has is important. As long as the 
variance of the noise of each equation is bounded, the DoF is not affected. Therefore, in what follows, we ignore noise and 
just focus on the number of independent equations available at each receiver. 

Remark: Note that if the transmitter has 2N transmit antennas, and each of the receivers has N antennas, then we can 
follow the same scheme and achieve DoF of ^j-. 

2) Generating Higher Order Symbols: We can observe the achievable scheme from another perspective. Remember in 
the second phase, we send a linear combination of L 2 (ua,va) and L 3 (u B ,v B ), e.g. L 2 (ua,va) + L 3 (u B ,v B ), to both 
receivers. We can consider L 2 (ua,va) + L 3 (u b ,v b ) as an order-two common symbol, required by both receivers. Let us 
define uab = L 2 (ua, va)+L 3 (u b ,v b ). If we have an algorithm which achieves the degrees of freedom of D0F2 for order-two 
common symbols, then we need DoF ^ 2 2) time-slots to deliver the common symbol ua B to both receivers. Therefore, in total, 
we need 2 + Dof ^ 2 2) t0 deliver four symbols ua, va, u b , and v B to the designated receivers. Thus, we have, 

4 



DoF 1 (2,2) = 



(21) 



It is easy to see that we can achieve DoF2(2,2) 
DoFi(2,2) of I is achievable. 



* DoF 2 (2 : 2) 

1 by simply sending uas to both receivers in one time-slot. Therefore, 



" va[i] ' 




I/a [2] 




_ Va[3] _ 





h A1 [3]h^} 



Rank Two 



UA 
VA 



2M[3] 



Va[1] 

-h A1 [3}y A [2] 





h\ x {2] 

h A1 [3]h A1 [2] h^h^p} 



Rank One 



L A2 

h A1 [3}h? m [l] /4[3]/4[2] 





UA 


+ 




VA 





z A [3] 



u B 

VB 



*a[1] 
za[3] 



*a[1] 



(22) 



(23) 



In summary, phase one takes as input two order-one symbols for each receiver. It takes two time-slots to deliver one desired 
equation to each of the receivers. Therefore, each receiver needs one more equation to resolve the desired symbols. If the 
transmitter ignores the overheard equations, we need two more time-slots to deliver one more equation to each receiver and 
yield the DoF of one. However, by exploiting the overheard equations, we can form a common symbol of order two. Delivering 
one common symbol of order two to both receivers takes only one time-slot but it simultaneously provides one useful equation 
to each of the receivers. Therefore using this scheme, we save one time-slot and achieve DoFi(2,2) = | rather than |. 

3) Interference Alignment using Outdated CSIT: Putting together the symbols received by receiver A over the three time- 
slots, we have (l22l . From (l22l . it is easy to see that at receiver A, the two interference streams ub and vb arrived from the 
same directions [0, /ijii[2], /i^i^/iai^]]^, and therefore ub and vb are aligned. Note that the alignment is done using outdated 
CSIT. By making the interference data symbols aligned at receiver A, the two symbols ub and vb collapse into one symbol 
h A1 [2]uB + h A2 [2)vB- Eliminating the variable h A1 [2]uB + h A2 [2)vB from ( f22l . we have (l23l , which is an equation set of the 
two desired symbols ua and va- It is easy to see that as long as ^.^[3] ^ and /i^-jlj/i^^l] — /^[lj/i^-jl] ^ 0, then the 
desired data symbols are not aligned at receiver A and they can be solved for. We note that at /^[l^^ll] — ^-^M^biM 
is the determinant of the channel matrix H[l]. Indeed, in this scheme, receiver A borrows the antenna of the second receiver 
at time-slot n = 1 to be able to solve for the two symbols. 

B. Achievable Scheme for M = K = 3 

In this section, we show how we achieve DoF of , ,i , i = tt f° r the channel with a three-antenna transmitter and three 
single-antenna receivers. As explained in the previous subsection, we can observe the achievable scheme from three different 
perspectives. However, we find the second perspective simpler to follow. Therefore, in the rest of the paper, we just explain 
the algorithm based on the second perspective. 

The achievable scheme has three phases. Phase one takes order-one symbols and generates order-two common symbols. 
Phase two takes order-two common symbols and generates order-three common symbols. The last phase takes order three- 
common symbols and deliver them to all three receivers. 

Phase One: This phase is similar to phase one for the 2 by 2 case. It takes three independent symbols for each receiver and 
generates three symbols of order two. Assume that u r , v r , and w r represent three symbols, independently Gaussian encoded, 
for receiver r, r = A, B, C. Therefore, in total, there are 9 data symbols. This phase has three time-slots, where each time-slot 
is dedicated to one of the receivers. In the time-slot dedicated to receiver A, the transmitter sends random linear combinations 
of ua, v A , and w A over the three antennas. Similarly, in the time-slot dedicated to receiver B, the transmitter sends random 
linear combinations of ub, vb, and wb over the three antennas. In the time-slot dedicated to receiver C, the transmitter sends 
random linear combinations of uc, vc, and wc over the three antennas. Refer to Fig. [2] for details. 

So far the algorithm has taken three time-slots and delivered three desired equations to the designated receivers. Therefore, 
in terms of counting the desired equations, the algorithm delivers one equation per time-slot which is natural progress for a 
system without CSIT. If we ignore the overheard equations, then we need six more time-slots to successfully deliver the 9 
data streams, which yields the DoF of one. However, as described in the 2 by 2 case, the overheard equations can help us to 
improve the degrees of freedom. 

Let us focus on the time-slot dedicated to receiver A. Then, we have the following observations: 

• The three equations Li(u A , v A , u>a), L2(ua,v a ,vj a ), and L 3 (u A , va, wa) form three linearly independent equations of 
ua, v A , and w A , almost surely. 





in — 2 






■ 
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■ 






m — 1 m — 2 m — 3 

^ A Li(l(A, t'_4, ) L-it/Ue, fB, 10 C ) Lriuc, vc, wc) 



^ *■ L>i{va,va, hi a) Ls(ub,vb,wb) Ls(uc, vc, wc) 



¥ *" Ls(ua, va, wa) L%{ub, vb, wb) L<)(uc,vc,wc) 



Fig. 2. Achievable Scheme for K = 3: Phase One 

• If we somehow deliver the overheard equations L2(ua, va, wa) and L^(ua, va, Wa) to receiver A, then it has enough 
equations to solve for ua, va, and wa- 

• The two overheard equations L2(ua, va, wa) and L^(ua, va, wa) plus the equation received by receiver A i.e. L\{ua, va, wa), 
fully represent the original data symbols. Therefore, sufficient information to solve for the data symbols is already available 

at the receivers, but not exactly at the desired receiver. 
We have similar observations about the equations received in the time-slots dedicated to receivers B and C. Remember 
that originally the objective was to deliver u r , v r , and w r to receiver r. After these three transmissions, we can redefine the 
objective. The new objective is to deliver: 

• (i) the overheard equations L2(ua, va, wa) and Ls(ua, va, wa) to receiver A, 

• (ii) the overheard equations L4(ub,vb,wb) and Lq(ub,vb,wb) to receiver B, and 

• (iii) the overheard equations Lj{uc,vc,wc) and L s (uc,vc,wc) to receiver C. 




Li2{uab,vab) 




L\z,{UAC,VAc) 













Fig. 3. Achievable Scheme for K = 3: Phase Two 



Let us define uab as a random linear combination of L2(ua, va, wa) and L^(ub,vb,wb)- To be specific, let uab = 
L2{ua,va,wa) + L4{ub,vb,wb)- Then we have the following observations: 

• If receiver A has uab, then it can use the saved overheard equation L4(ub,vb,wb) to obtain ^("A, va, wa)- Remember 
£2(11 a, va, wa) is a desired equation for receiver A. 

• If receiver B has uab, then it can used the saved overheard equation L2 (ua , va , wa ) to obtain L^ubtVb, wb ) ■ Remember 
Liius, vb, wb) is a desired equation for receiver B. 

Therefore, uab is desired by both receivers A and B. Similarly, we define uac = L^(ua, va, wa) + Lt(uc, vc, wc), which 
is desired by receivers A and C, and define ubc — Lq(ub, vb, wb) + Ls(uc, vc, wc), which is desired by receivers B and 
C. We note that if receiver A has uab and uac, then it has enough equations to solve the original data symbols ua, va, and 
wa- Similarly, it is enough that receiver B has uab and use, and receiver C has uac and ubc- Therefore, again, we can 
redefine the objective as delivering uab to receivers A and B, uac to receivers A and C, and ubc to receivers B and C. 
Suppose now we have an algorithm that can achieve DoF 2 (3, 3) degrees of freedom for order-two common symbols. Then, 
the total time to deliver the original 9 data symbols is the initial three time-slots of sending linear combinations of the 9 



symbols plus 



DoF 2 (3,3) 



time-slots to deliver the three order-two symbols generated. Therefore, the overall DoF to send the 



order-1 symbols is given by 

DoFi(3,3) = — 9 3 . (24) 

6 + DoF 2 (3 : 3) 

It is trivially easy to achieve DoF2(3, 3) = 1, which yields DoFi(3, 3) of 2, However, as we will elaborate in the following, 
we can do better. 

Phase Two: Phase one of the algorithm takes order-one symbols and generates order-two symbols to be delivered. Phase 
two takes order-two symbols, and generates order-three symbols. Phases two and three together can also be viewed as an 
algorithm which delivers order- two common symbols. 

Assume that uab and vab represent two symbols that are desired by both receivers A and B. Similarly, uac and vac are 
required by both receivers A and C, and ubc and vbc are required by both receivers B and C. Therefore, in total, there are 
6 order-two symbols. We notice that phase one generates only three order-two symbols. To provide 6 order-two symbols, we 
can simply repeat phase one twice with new input symbols. Phase two takes three time-slots, where each time-slot is dedicated 
to one pair of the receivers. In the time-slot dedicated to receivers A and B, the transmitter sends random linear combinations 
Uab and vab from two of the transmit antennas. We have analogous transmissions in the other two time-slots. For details, 
see Fig. [3] 

In Fig. |2 we focus on the first time-slot dedicated to both users A and B. Then, we have the following important 
observations: 

• Li (uabi vab) and L 12 {uab,vab) form two linearly independent equations of uab and vab, almost surely. 

• Similarly, Lh(uab, vab) and L 12 (uab,vab) form two linearly independent equations of uab and vab, almost surely. 

• If L\ 2 {uab, vab) is somehow delivered to both receivers A and B, then both receivers have enough equations to solve 
for uab and vab- Therefore, L 12 {uab, v ab), which is overheard and saved by receiver C, is simultaneously useful for 
receivers A and B. 

We have similar observations about the received equations in the other two time-slots. Therefore, after these three time-slots, 
we can redefine the objective of the rest of the algorithm as delivering 

• (i) Li 2 (uab,vab) to receivers A and B, 

• (ii) Li4(uac, v ac) t° receivers A and C, and 

• (hi) L le (uBC,UBc) to receivers B and C. 

Let us define uabc and vabc as any two linearly independent combinations of Li 2 (uab,vab) and L\^(uac, uac), and 
Li 6 (u B c,ubc)- 

UABC = OtiLi 2 {u A B, Vab) + Ot 2 Li A (uAC, vac) 

+ a 3 Li 6 (u B c, vbc), 
vabc = PiL\ 2 (uab, vab) + faLu(uAC, vac) 

+ PzLiq(ubc, vbc), 

where the constants on and (3i, i = 1,2, 3, have been shared with receivers. If we somehow deliver uabc and vabc to receiver 
A, then together with its saved overheard equation L\q(ubc ,vbc), receiver A has 3 linearly independent equations to solve 
for L\ 2 (uab, vab) and L\^(uac, v ac)- Then, it has enough equations to solve for uab, vab, uac, and vac- We have the 
similar situation for receivers B and C. Therefore, it is enough to deliver uabc and vabc to all three receivers. If we have 
an algorithm that can provide DoF3(3, 3) degrees of freedom to deliver order-three common symbols, then the total time to 
deliver the original 6 order-two common symbols is 3 + DoF3 2 ( 3 3) > taking into account the first three transmissions (described 
in Fig. |3). Therefore, we have 

DoF 2 (3,3) = - ■ 6 2 . (25) 

6 + DoF 3 (3 : 3) 

Phase Three: Phase Three transmits order-three common symbols. This phase is very simple. Assume that uabc is required 
by all three receivers. Then, the transmitter can use only one transmit antenna and send uabc- All three receivers will receive 
a noisy version of uabc- Therefore, we use one time-slot to send one order-three symbol. Therefore, DoF3(3, 3) = 1. Then, 
from (ED and ([25), we conclude that DoFi(3,3) = if and DoF 2 (3,3) = §. 



C. General Proof of Achievability for Theorem]!] 

In this section, we explain the achievable scheme for the general case in Theorem [TJ 

First we focus on the general M = K square case. The algorithm is based on a concatenation of K phases. Phase j takes 
symbols of order j and generates symbols of order j + 1. For j = K, the phase is simple and generates no more symbols. For 
each j, we can also view phases j, j + 1, . . . K together, as an algorithm whose job is to deliver common symbols of order j 
to the receivers. 

The phase takes (K-j + 1) ( K ) common symbols of order j, and yields .7 ( symbols of order j + 1. This phase has 
(■y) time-slots, with each time-slot dedicated to a subset S of receivers, \S\ — j. We denote the time-slot dedicated to the subset 
S by tg. In this time-slot, the transmitter sends random linear combinations of the K—j+1 symbols us,i,tis,2, ■ ■ ■ , us,K-j+it 
desired by all the receivers in S. The transmitter utilizes K — j + 1 of the transmit antennas. 

The linear combination of the transmitted symbols received by receiver r is denoted by Ls, r - Let us focus on the linear 
combinations of the transmitted symbols received by all receivers, in time-slot t$- We have the following observations: 

• For every r G 6>, the K — j + 1 equations consisting of one equation Lg r and the K — j overheard equations: {Ls, r ' ■ 
r' G £\S} are linearly independent equations of the K — j + 1 symbols Us,i, us,2) ■ ■ ■ , Us,K-j+i- This relies on the fact 
that the transmitter uses K — j + 1 transmit antennas. 

• For any r, r g S, if we somehow deliver the K — j equations {Lg.r 1 ■ r' G £\£>} to receiver r, then receiver r has 
K—j + 1 linearly independent equations to solve for all K — j + 1 symbols Us i, 1*5,2, ■ ■ ■ , us.K-j+i- 

• Having the above two observations, we can say that the overheard equation by receiver r', r' G £\S, is simultaneously 
useful for all receivers in S. 

After repeating the above transmission for all S, where S C £ and |5| = j, then we have another important observation. 
Consider any subset T of receivers, where |T| = j + 1. Then each receiver r, r € T, has an overheard equation Lq-\^ r , 
which is simultaneously useful for all the receivers in T\{r}. We note that the transmitter is aware of these overheard 
equations. For every T C £, |T| = j + 1, the transmitter forms j random linear combinations of L-j-\{ r } ,n r e T, denoted by 
wr,ij u T.2i ■ ■ ■ , u T,j- We note that 1 < £ < j, is simultaneously useful for all receivers in T. Indeed, each receiver r in T 
can subtract the contribution of L-j-\{ r y r from uj- 5, £ = 1, . . . , j, and form j linearly independent combinations of £7-^,,} ,., 

(A) 



r G T\{r}. Using the above procedure, the transmitter generates j( symbols of order j + 1. The important observation is 



that if these j'( +1 ) symbols are delivered to the designated receivers, then each receiver will have enough equations to solve for 

all of the original common symbols of order j. Delivering j'Q-^jJ order-(j+ 1) symbols takes DoF ~M K ^ using an algorithm 
that provides DoFj+i(K, K) degrees of freedom for order-(j + 1) symbols. Since the phase starts with (K — j + 1)(^) 
symbols of order j, and takes (^) time-slots, and generates j( symbols with order j + 1, we have 

(K -j + l)( K ) 

DoF, (K, K) = ,' K \ J , (26) 



3) DoF J + i(K,K) 

or 

K-j + 1 1 1 K - j 



j DoFj (K, K) j 3 + l DoF J+1 (K, K) ' 
It is also easy to see that DoF^(_ftr) = 1 is achievable. Solving the recursive equation, we have 



(27) 



K-j + 1 1 

DoF, (K, K) = ■!— - x j . (28) 



3 J+l K 



In particular, 



DoF 1 (K,K)= — — y— — -r • (29) 
J- ~r o ~r • • • ~r t? 



2 1 ■ • ' 1 K 

Therefore the achievablity of Theorem [TJ in the square case has been established. 

Now observe that in the above algorithm, phase j only requires the use of K — j + 1 transmit antennas, not all K of the 
transmit antennas. Moreover, common symbols of order j are delivered using phases j, j + 1, . . . , K. Hence, we conclude that 
the degree of freedom of order-j messages achieved above in the square system can actually be achieved in a system with 
less transmit antennas as long as M > K — j + 1. This proves Theorem [TJ in the rectangular case as well. 



Remark: We note that if the transmitter has KM transmit antennas, and each of the K receivers has N receive antennas, 
then the D0F1 of 

KN 



1 + 1 + .. . + i 



is achievable. More generally, in this channel, for order- j symbols, the DoFj of 

K-j+1 N 



7 i + + -i- 



is achievable. 



D. Implementation Issues 

For simplicity, the proposed scheme has been presented in a symbol-by-symbol based format. However, this scheme can be 
implemented in a block-by-block fashion as well. This would allow us to exploit the coherence of the channel over time and 
frequency to reduce channel training and feedback overhead. To be specific, let us again focus on the case of M = K = 2. 

Consider a block of time-frequency resources, consecutive in time and frequency. Let us assume that in the first phase of 
the scheme, we dedicate half of these resources to receiver A and the other half to receiver B. To start the second phase, 
the transmitter needs to know channel coefficients during the first phase. For example, if the lengths of the block in time 
and frequency are respectively less than coherent time and bandwidth of the channel, then during the first phase the channel 
coefficients are (almost) constant. Therefore, to start the second phase, the transmitter needs only to know the four channel 
coefficients. Let us denote the coherent time and bandwidth by T c and W c respectively. Then, for each T C W C time-frequency 
resources, the transmitter needs to dedicate at least two time-frequency resources to send orthogonal pilot signals and learn 
four coefficients through feedback. Then, the transmitter uses the remaining resources to send 2T C W C — 2 order-one symbols. 
Remember that the transmitter is also required to report the channel coefficients of each receiver to the other receiver. Since 
each receiver knows its own channel state information, the transmitter can exploit that and send to both receivers the two 
symbols of + /i.bi[1] and /^[l] + ^S2[l]> as the symbols of order-two in the second phase. Therefore, the second 

phase takes c A c ~ + 2 resource units for order-two messages. Following the above argument, the scheme can achieve DoF 
°f 1t"w ^ +0 5 ' ^ T C W C S> 1, as in most wireless channels, then the degree of freedom is close to 4/3. 

IV. Outer-Bound 

In this section, we aim to prove Theorem |2] In this theorem, we focus on the degrees of freedom of the channel for order- j 
messages. Therefore, we assume for every subset S with cardinality j of receivers, the transmitter has a message Ws, with 
rate Rs and degrees of freedom d$. 

Remember in Section [II] we assume that the channel state information is available to all nodes with one time-unit delay. 
As an outer-bound, we consider the capacity of a channel in which the channel state information at time n is available to all 
receivers instantaneously at time n. Therefore, at time n, receiver r has (y r [t), H[t]), t = 1, . . . , m, for any r, 1 < r < K. On 
the other hand, the transmitter has not only the channel state information, but also received signals, both with one unit delay. 
Therefore, at time n, the transmitter has (yi[t], . . . ,yK[i\,H[i\), t = 1, ... ,m — 1. Now, we improve the resultant channel 
even further as follows. 

Consider a permutation tt of the set £ = {1,2,..., K}. We form a if -receiver broadcast channel, by giving the output 
of the receiver to the receivers j = i + 1, . . . ,K, for all i = 1, . . . , K — 1. Therefore, we have an upgraded 

broadcast channel, referred to as improved channel with K receivers as (y^i) [ n ]i H[n]), [n], 2/^(2) [n], H[n]J, . . ., 

(2/7r(i)Mi2/7r(2)[ n ]i ■ • ■ i y^(K)[ n ]i H[n]). We denote the capacity of the resultant channel as Ci mprove d(7r). Denoting the capacity 
of the original channel with C, we obviously have C C Ci mprove( i(7r). Moreover, it is easy to see that the improved channel is 
physically degraded. 

In the improved channel, consider message Ws, which is required by all j receivers listed in S. Let i* be the smallest 
integer where 7r(i*) G S. Then, due to the degradedness of the channel, if Ws is decoded by receiver then it can be 

decoded by all other receivers in S. Therefore, we can assume that Ws is just required by receiver Using this argument, 

we can simplify the messages requirements from order-j common messages to pure private messages as follows: receiver 7r(l) 



requires all messages Ws, where 7r(l) E S and S G £. Similarly, receiver 7r(2) requires all messages Ws, where 7r(2) 6 S 
and 5 C £\{tt(1)}. We follow the same argument for all receivers. 

According to Q, feedback does not improve the capacity of the physically degraded broadcast channels. Consequently, we 
focus on the capacity region of the improved channel without feedback, and with the new private message set. On the other 
hand, for broadcast channels without feedback, the capacity region is only a function of marginal distributions. Therefore, we 
can ignore the coupling between the receivers in the improved channel. Thus, we have a broadcast channel where receiver 
7r(j) has i antennas, and the distributions of the channels between the transmitter and any of the receive antennas are identical. 
Moreover receiver ir(i) is interested in all messages Ws, where ir(i) G S, \S\ = j, and S C £ \{ir(l), tt(2), . . . , 7r(i — 1)}. 

Therefore, according to JSJ, extended by 0, one can conclude that 

K-j+l 

E ■ J- M x E d <s<l. (30) 

' mm i.M 

i=l 1 ' 1 \s\= 3 

SC£\{7r(l),...7r(i-l)} 
7r(i)e5 

By applying the same procedure for any permutation of the set {1, 2, . . . , K} and then adding all of the K\ resulting 
inequalities, the theorem follows. 

V. The DoF region for K = M 

In this section, we prove Theorem [3] which characterizes the DoF region of the channel for the case M = K. 

We note that the region of Theorem [3] is the polyhedron proposed by the outer-bound (f30b for order-one messages where 
M = K. Here, we show by induction on K that the region is achievable. The hypothesis is clearly true for K = 1. Now assume 
that the hypothesis is true for K = 1, . . . , k — 1. Consider the case when K = k. First we argue that any point (di, di, . . . , dk) 
in the polyhedron such that di > for all i and di ^ dj for some i, j cannot be a corner point of the polyhedron. Without loss 
of generality, we can assume that the coordinates of such a point is ordered in a non-decreasing order, since the polyhedron 
is invariant to permutation of coordinates. Let i\, %2 be such that either < d\ = . . . = dt 1 < e^+i = . . . = di 2 < dj 2 +i, or 
< di = . . . = di 1 < dii+x = ■ . ■ = di 2 and = k. Now a direct calculation shows that tt is a permutation of {1, . . . , k} 
which maximizes: 

k J 
a 7r(i) 

i=l 1 

among all permutations if and only if d^^ > d^O) whenever i < j. This means that the only constraints, if any, of the 
polyhedron that [dx, . . . ,dk) satisfies with equality correspond to permutations satisfying 7r(j) € {1, for all j € 

{k — ii + 1, . . . , k} and 7r(j) 6 {i\ + 1, . . . , 1%} for all j G {k — i-x + 1, . . . , k — All other constraints are satisfied 
with strict inequality. We define vector (ei, . . . , e/c) as 

e, = =^s — for i = 1, . . . , h 

ei = k -il e r fori = u + l, ...,i 2 (31) 

ei — otherwise 
An explicit calculation shows that for any e > 0, both the point 

(Ji, . . . ,d k ) + {e\, . . . ,e fc ) 

and the point 

(dt,...,d k ) - (ei,...,e fc ) 

continue to satisfy the tight inequalities with equality. Moreover, for e sufficiently small, the constraints that are not tight on 
(di, . . . , dk) remain not tight on these 2 points. Hence, both these points lie in the polyhedron, and hence (di, . . . , dk), which 
is the average between these points, cannot be a corner point. 

Thus, the only point in the strict positive quadrant that can be a corner point of the polyhedron is the point: 

1 

— ; 1,1,. ..,1 . 

Ek i v ' ' ' ' 
i=i 7 



This point is achievable by Theorem Q] Any other point in the polyhedron is a convex combination of this point and points 
for which some of the coordinates are zero. Each one of these latter points is in fact in the polyhedron for some smaller value 
of K — k' < k. By the induction hypothesis, each of these points is achievable. Hence, by time-sharing, any point in the 
polyhedron for K — k is achievable. 

VI. Achievable Scheme for Theorem[4] 

In Section Hill we explained an algorithm to achieve DoF^(M, K), when M > K. More generally, we characterized 
DoF*(M, A"), when M > K — j + 1. In this section, we extend the optimal achievable scheme of Section ITlTl and develop a 
sub-optimal algorithm for the case that M < K — j + 1 for order- j messages. We first focus on the case M = 2 and K = 3. 

A. Achievable Scheme for M — 2, K = 3 

From Theorems [T] and |2] we have DoF2(2,3) = | and DoF3(2,3) = 1. However, for order-one messages, we only know 
from the outer-bound that DoF^(2,3) < |. On the other hand, in terms of achievability, it is easy to see that DoF^(2,3) > 
DoF*(2,2) = | which can be achieved by simply ignoring one of the receivers. Now the question is whether DoF^(2,3) is 
indeed the same as DoF^(2, 2) or the extra receiver can be exploited to achieve DoF beyond DoF^(2, 2). Here we propose an 
algorithm to show that DoF:[(2,3) > DoF*(2,2). 

The achievable scheme is as follows. Let u r , v r , w r , and tp r be four symbols for receiver r, r = A,B,C. The first phase 
of the scheme has 6 time-slots. The first two time-slots are dedicated to receiver A. In these two time-slots, the transmitter 
sends four random linear combinations of ua, va, wa, and ipA through the two transmit antennas. As a particular example, 
in the first time slot, the transmitter sends ua and va, and in the second time slot, it sends wa and ipA- Refer to Fig. @]for 
details. Similarly, in time-slots 3 and 4, the transmitter sends four random linear combinations of ub, vb, wb, and ips- In 
time-slots 5 and 6, the transmitter sends four random linear combinations of uc, vc, wc, and ipc- 

Referring to Fig. |4] we have the following observations: 

« Receiver A already has two independent linear equations Li(ua, va) and L^{wa, Vm) of ua, va, wa, and ipA- Therefore, 
it needs two more equations. 

> The four overheard equations in L2(ua,va), L^{uaiVa), L^wat^a), and Lq(wa,iPa) are not linearly independent 
from what receiver A has already received, i.e. Li(ua, va) and L^wa, VmJ- 

• We can purify the four overheard equations and form two equations that are linearly independent with Li(ua,va) and 
Li(wA, iPa)- For example, receiver B can form Li(ua, va, wa, ^a) as a random linear combination of L^t^A, va) and 
Ls(wa, iPa)- Similarly, receiver C can form L^(ua, va, wa, iPa) as a random linear combination of L^(ua,va) and 
Lq(wa, ^a)- The coefficients of these linear combinations have been preselected and shared among all nodes. 

• It is easy to see that almost surely, L 2 (ua, va, wa,4>a) and L 3 (ua, va, wa, ^Pa) are linearly independent of Li(ua, va) 
and L 4 (w A , iPa)- 

m If somehow we deliver Li(ua, va, wa, "*Pa) and L^(ua, va, wa, iPa) to receiver A, then it has enough equations to solve 
for ua, va, wa, and ipA- 

Similarly, as shown in Fig. we can purify the overheard equations in time-slots dedicated to receivers B and C. Now, the 
available side information and the requirements are the same as those we had after phase one for the case of M = K = 3 (see 
Subsection IIII-BI ). Equations L,2(ua, va, wa, iPa) and L^ua, va, wa, iPa) are available at receivers B and C, respectively, 
and are needed by receiver A, equations L4(ub,vb,wb,iPb) and Lq(ub,vb,wb,^b) are available at receivers A and C, 
respectively, and are needed by receiver B, and equations Lr(uc,vc,wc,ipc) and Lg(uc,vc,wc,ipc) are available at 
receivers A and B, respectively, and are needed by receiver C. We define 

u A b = L 2 {ua,va,w a ,^a) + L4(u b ,vb,wb,4>b), (32) 
u A c = L 3 (u A , v A , w A , iPa) + L 7 {u c , v c , w c , ipc), (33) 
ubc = L 6 (u b ,vb,w b ,tPb) + L 8 (uc,v c ,wc,ipc)- (34) 

Considering the available overheard equations at each receiver, one can easily conclude that uab is needed by both receivers 
A and B, uac is needed by both receivers A and C, and ubc is needed by both receivers B and C. The transmitter needs 



DoF* 3 (2 3) time-slots t° deliver these three order-two symbols, where according to Theorem Q] DoF2(2,3) = |. In summary, 
phase one starts with 12 order-one messages, takes 6 time-slots, and generates 3 order-two symbols. Therefore, we achieve 

12 _ 24 
~ 17' 



DoF x (2,3) 



6 + 



(35) 



DoF*(2,3) 



which is strictly greater than DoF^(2, 2) = |. Therefore, the proposed achievable scheme exploits the extra receiver to improve 
DoF. However, we notice that the achieved DoFi(2, 3) of || is still less than | = || which is suggested by the outer-bound. 
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Fig. 4. A Sub-Optimal Scheme for M = 2 and K = 3, The First Phase 



B. General Proof for Theorem 

Here, we explain a general version of the proposed algorithm. Again the algorithm includes K — j + 1 phases. Phase j 
takes symbols of order j (meaning that it is needed by j receivers simultaneously), and generates symbols of order j + 1. For 
j = K, the phase is simple and generates no more symbols. 

Let us define qj as 

qj = min{M -l,K-j}. (36) 
In addition, we define iy as the greatest common factor of qj and K — j, i.e. 

r h =gcf{q 3 ,K -j}. (37) 

Phase j takes (K — j) (^) symbols of order j and yields symbols with order j + 1. This phase has (^) 

sub-phases, where each sub-phase is dedicated to a subset S of the receivers, \S\ — j. The sub-phase dedicated to subset S 
is denoted by S-Ph(5). Each sub-phase takes time-slots. In S-Ph(5), the transmitter sends random linear combinations 
of & = symbols us.i,us,2, ■ ■ • ,«s,ft, desired by all receivers in S. The transmitter uses at least q 3 + 1 of the 

transmit antennas. The linear equation of the transmitted symbols received by receiver r, in the t-th time slot of S-Ph^), is 
denoted by L$ tr (t). Let us focus on the equations of the transmitted symbols received by all receivers in S-Ph(5). We have 
the following observations: 

• For every r, r G <S, and t, t 6 {1, 2, . . . , -^}> the K — j + 1 equations r' G {r} U £ \5} are not necessarily 
linearly independent. The reason is that \{r} Li£ \S\ = K — j + 1, while the number of transmit antennas is M which can 
be less than K — j + 1. Indeed, among the K — j overheard equations Lgy (t), r' G £\S, we can only form qj overheard 
equations that are simultaneously useful to receiver r, for any r in S. Therefore, among > ~ J> overheard equations in 
S-Ph(£>), we can form only ^^^^ overheard equations that are useful for any receiver r, r G S. 

• We purify the overheard linear combinations. To this end, receiver r', r' G £\S, forms 2l linear combinations of Ls, r '(t), 
t=l,..., ^7^. The resultant equations are denoted by Lg L$ ,r'(2), • • • , Ls.r' (f 1 ) ■ The coefficients of the linear 
combinations have been preselected and shared among all nodes. It is easy to see that for every r, the following ^ +1 ^ K ll 
equations are linearly independent: Ls_ r (t), t = 1, . . . , and Ls, r '{i), r 1 G £ \S and i G 1, . . . , Therefore, if we 
somehow deliver Ls r >(i), r' G £ \S and i G 1, . . . , ^ to receiver r, 7- G S, then it will have j3j — < ' q3+1 ^ K — linearly 
independent equations to solve for all desired symbols Us, l, us, 2, ■ ■ ■ ,us,/3,-. 



• Having the above two observations, we note that the purified linear combinations by receiver r', r' £ S\S, are simulta- 
neously useful for all receivers in <S. 

After repeating the above transmission for all S, where S C £ and |<S| = j, then we have another important property. 
Consider a subset T of the receivers, where \T\ = j + 1. Then each receiver r, r £ T, has purified linear combination 
LT\{r},r{t)> t = 1, • • • j which are simultaneously useful for all receivers in T\{r}. We note that the transmitter is aware 
of these purified equations through delayed CSIT. For every T C £ , |7~| = j + 1, the transmitter forms j^- random linear 
combinations of Lf\i r \ tr (t), r £ T, t = 1, . 



We note that Uq~£, 1 < £ < j 



2i, denoted by u T ,i,u T , 2 , ■ ■ ■ ,u Td ^_. , „ 

is simultaneously useful for all receivers in T. The reason is that each receiver r, r £ T, can subtract the contributions of 



-^T\{-r},-r(^)> £ = !)•••) ^> from tt"r,<;> £ = and form j'^- linearly independent combinations of L-]-\{ r >y ir i(t), 

r' G T\{r}, t = 1, . .., Therefore, using the above procedure, the transmitter forms _7^(-^ 1 ) symbols with order j + 1. 



The important observation is that if these 



symbols are delivered to the designated receivers, then each receiver will 
have enough equations to solve for all designated messages with order j. 



In summary, this phase takes (K — j) 9 '^ (^) symbols of order j, takes ^r^( ^) time-slots, and yields j-^Q^-J symbols 
of order j + 1. If we have a scheme which achieves DoF J+ i(M, K) for order-(j + 1) symbols, then we achieve DoFj(M, K), 



DoFj(M, if) = 



(^-J)^(i) 

rjj V j / + DoF J + 1 (M,K) 



or 



5; 



j DoFj(M, K) 



1j 



j + lDoF j+ i(M,K)' 



(38) 



(39) 



VII. Improved Scheme for M = 2 

Recall that the scheme of Section PVTl achieves DoF!(2,3) of |4. The achieved DoF is greater that DoF*(2,2) = |, which 
shows that we could exploit the extra receiver with respect to the number of transmit antennas. However, it is still smaller than 
| which is suggested by the outer-bound. Now the question is whether the achievable scheme or the outer-bound is loose. 

In what follows, we show that for M = 2 and K = 3, the outer-bound is tight and the achievable scheme of Section [VI] is 
loose. Before that, we explain an alternative solution for a system with M = K = 2. The idea of the alternative solution is 
the key to achieve the optimal DoF for the systems with M = 2 and K = 3. 

A. Alternative Scheme for M = K = 2 

Phase one of the algorithm takes order-one messages. Let us assume that the transmitter has ua and va for receiver A and 
ub and vb for receiver B. Here, phase one takes only one time-slot which is dedicated to both receivers. In this time-slot, the 
transmitter sends random linear combinations of all four symbols ua and va, ub, and vb- Refer to Fig.|5]to see the details of 
particular examples for the linear combinations. Receiver A receives a linear combination of all four symbols. We denote this 
linear combination by Lx(ua, va) + L^^ubjVb), where L\(ua, va) represents the contribution of ua and va, and L^(ub,vb) 
represents the contribution of ub and vb- Similarly, receiver B receives a linear combination of all four symbols denoted by 
L 2 (ua,v a ) + L4(u B ,v B )- 



ua + ub 



VA + Vb 



m = 2 m = 3 



L 2 (ua,va] 



L3(u b ,Vb) 




m = 1 


m = 2 


m = 3 


Li{u A ,v A ) +L ;i {uB;V B ) 




hAl\2}L 2 (uA,VA) 




hAi[3]L 3 (uB,v B ) 








L2{u A ,v A ) + L t (uB, v B ) 




h B l[2]L2(uA;VA) 




hBi[3]L 3 (uB,v B ) 



Fig. 5. Alternative Achievable Scheme for M = K = 2 

Then, we have the following observations: 



• If we somehow give L$(ub, vb) to receiver A, then receiver A can compute Li(ua, va) by subtracting Ls(ub,vb) from 
what it already has. Then if we also give L 2 (ua,va) to receiver A, then it has two equations to solve for ua and va- 

• If we somehow give L 2 (ua,va) to receiver B, then receiver B can compute L 4 (ub,vb) by subtracting L 2 (ua, va) from 
what it already has. Then if we also give L^(ub, vb) to receiver B, then it has two equations to solve for ub and vb- 

In other words, both receivers A and B want L2(ua,va) and L^(ub,vb)- Therefore, we can define two order-two symbols 
uab and vab as 

Uab = L 2 (ua,v a ), (40) 
v A B = L 3 (u B ,v B )- (41) 

In summary, this phase starts with 4 order-one symbols, takes one time-slot, and provides two order-two symbols. Two 
order-two symbols take DoF * 2 ( 2 2) time-slots to deliver. Therefore, we achieve 

DoFi(2,2)= 4 2 ■ (42) 

+ DoF5(2,2) 

Since DoF2(2,2) = 1, this scheme achieves DoF*(2,2) = |. 



B. Optimal Scheme for M — 2 and K = 3 

Here, we explain an algorithm for the systems with M = 2 and K = 3. The first phase of this algorithm takes 12 order-one 
messages, takes 3 time-slots, and gives 6 order-two symbols. This sub-algorithm leads to an optimal scheme for systems with 
M = 2 and K = 3. 

Let u r , v r , ity, and ip r be four symbols for receiver r, r = A, B, C. In the first time slot, which is dedicated to receivers 
A and B, the transmitter sends random linear combinations of four symbols ua and va, ub, and vb- Refer to Fig. [6] to see 
the details of particular realizations for the linear combinations. Receiver A receives a linear combination of all four symbols 
denoted by Li(ua,va) + L 4 (ub,vb)- Receivers B and C also receive linear combinations of all four symbols denoted by 
L 2 (ua> va) + L^{ubi v b) and L 3 (ua, va) + Lq(ub, Vb), respectively. In the second time slot, which is dedicated to receivers 
A and C, the transmitter sends random linear combinations of four symbols wa and ipA, uc, and vc- In the third time slot, 
which is dedicated to receivers B and C, the transmitter sends random linear combinations of four symbols wb, ipB, Wc, and 
i>c- 

By referring to Fig. [6] it is easy to see that for each receiver to solve for all four desired symbols, it is enough that 
. receiver A has L 2 (u A ,v A ), L4(u B ,v B ), L 9 (w a ,iPa), and L 10 (u c ,v c )- 
. receiver B has L 2 (u A ,v A ), L 4 (u B ,v B ), L 15 (w b ,^b), and L 17 (w c ,ipc)- 

• receiver C has L 9 (w A , 4>a), L 1Q (u c ,v c ), L 15 (w b ,^b), and L 17 (w c ,ipc)- 
Therefore, the transmitter needs to deliver 

• L 2 (ua,va) and L 4 {ub,Vb) to both receivers A and B. 

• L^wm, 4>a) and Lw(uc, vc) to both receivers A and C. 

• Li5(wb,iPb) and Lyj(wc,ipc) to both receivers B and C. 
Therefore, we have 6 order-2 symbols as 

uab = L 2 (u A ,v A ), v A B = L 4 (u B ,v B ), (43) 

uac = L 9 (w a ,iPa), vac = L 10 (u c ,vc), (44) 

ubc = Lx 5 (wb,iPb), v B c = Li 7 (w c ,ipc)- (45) 
Therefore, the transmitter needs DoF * 6 ( 2 3) more time-slots to deliver these 6 order-two symbols. Thus, we have 

DoFi(2,3)= - ■ 12 6 =~ (46) 

6 + DoFJ(2,3) 

where we used Theorem[T]to set DoF2(2,3) = |. Note the outer-bound in Theorem [2] yields DoF^(2,3) < |, and therefore, 
this algorithm meets the outer-bound. This result shows that the scheme of Section [Vl] is in general suboptimal. 
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Fig. 6. Optimal Scheme for a System with M = 2 and A" = 3, The First Phase 



VIII. Connections with the Packet Erasure Broadcast Channel 

The schemes we proposed in this paper are inspired by schemes designed for the packet erasure broadcast channel, where 
each receiver observes the same transmitted packet but with a probability of erasure, and acknowledgement feedback is received 
by the transmitter from both receivers. Here, the delayed CSI that is fed back to the transmitter is the erasure states of the 
previous transmissions. 

The goal of these packet erasure broadcast schemes is to exploit the fact that a packet intended for a receiver may be erased 
at that receiver but received at other receivers. These overheard packets become side information that can be exploited later. The 
basic scheme, initially proposed by [10] for unicast setting, and then by ifTTl for multicasting setting, in the two-receiver case, 
works as follows. The transmitter sends packets intended for each receiver separately. If a packet is received by the intended 
receiver, then no extra effort is needed for that packet. But if a packet is received by the non-intended receiver, and not received 
by intended receiver, that receiver keeps that packet for later coding opportunity. Let us say packet xa intended for receiver 
A is received by receiver B, and packet xb intended for receiver B is received by receiver A. In this case, the transmitter 
sends (xa XOR xb)- Then if receiver A receives it, it can recover xa by subtracting xb, and if receiver B receives it, it can 
recover xb by subtracting xa- In lfl2l . the outer-bound of ifPJI is used to show that the scheme of IfTTl is optimal. In fl4l . 
lfl5l . this two-receiver scheme is extended to more than two receivers, when all receivers have identical erasure probability. 
The scheme we proposed in this paper for the MIMO broadcast channel can be viewed as the counterpart to this scheme for 
the packet erasure broadcast channel. 

IX. Follow-up Results 

After the conference version of this paper has appeared in lfl6l . the problem of exploiting outdated CSIT in networks have 
been investigated in several pieces of work. In 11171 . it is shown that for three-user interference channels and two-user X 
channels, outdated CSIT can be used to achieve DoF more than one. In [18|, for two-user X channels, the result of IfTTl has 
been improved and for three-user case, an achievable DoF has been proposed. In |19|, an achievable DoF for Jv-user single- 
antenna interference channels has been derived. In [20|-[22|, the DoF regions of two-user and three-user MIMO broadcasts 
channels and two-user MIMO interference channels with delayed CSIT are studied. In [23], the load of feedback to implement 
the proposed scheme is evaluated. It is shown that for a wide and practical range of channel parameters, the scheme of this 
paper outperforms zero-forcing precoding and also single-user transmission. 

X. Conclusions 

From the point of view of the role of feedback in information theory, this work provides yet another example that feedback 
can be useful in increasing the capacity of multiuser channels, even when the channels are memoryless. This is in contrast 
to Shannon's pessimistic result that feedback does not increase the capacity of memoryless point-to-point channels lf24l . In 
the specific context of broadcast channels, Ozarow lfT3l has in fact already shown that feedback can increase the capacity 
of Gaussian scalar non-fading broadcast channels. However, the nature of the gain is unclear, as it was shown numerically. 
Moreover, the gain is quite limited. We argue that the MIMO fading broadcast channel considered in this paper provides a much 
more interesting example of the role of feedback. The nature of the gain is very clear. In contrast to the Gaussian scalar non- 
fading broadcast channel, the main uncertainty from the point of the view of the transmitter is the channel direction rather than 



the additive noise, particularly in the high SNR regime. This means that although the MIMO channel has intrinsically multiple 
degrees of freedom, the transmitter cannot segregate it into multiple orthogonal channels, one for each receiver. Hence, when 
transmitting information for one receiver, significant part of that information is overheard at other receivers. This overheard 
information becomes side information that can be exploited in future transmissions. The role of feedback is to provide the 
channel directions to the transmitter after the transmission to allow the transmitter to determine the side information that was 
received at the receivers. Overall, feedback leads to a much more efficient use of the intrinsic multiple degrees of freedom in 
the MIMO channel, yielding a multiplexing gain over the non-feedback case. 



Appendix A 
An Identity 

In this appendix, we prove that for any j, 1 < j < K, 

K-j+l (K-i\ K 1 

(±7 £ ¥ = £t- (47 > 

V?— 1/ i — 1 i—j 

We define LHS of <g7) as f{j), 

K-j+l (K-i\ 

/(i) = T 4 T E 

Then it is easy to see that f(K) = -p. In what follows, we prove that for any j, 1 < j < K — 1, 



which yields identity ( |47| i. 
We have 
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where (a) follows from the identity that 



i= P 

Equation ( f49T > can simply be proved by induction 



(48) 
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